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We study online mechanisms for preemptive scheduling with deadlines, with the goal of maximizing the total value of 
completed jobs. This problem is fundamental to deadline-aware cloud scheduling, but there are strong lower bounds even for 
the algorithmic problem without incentive constraints. However, these lower bounds can be circumvented under the natural 
assumption of deadline slackness, i.e., that there is a guaranteed lower bound s > 1 on the ratio between a job’s size and the 
time window in which it can be executed. 

In this paper, we construct a truthful scheduling mechanism with a constant competitive ratio, given slackness s > 1. 
Furthermore, we show that if s is large enough then we can construct a mechanism that also satisfies a commitment property: 
it can be determined whether or not a job will finish, and the requisite payment if so, well in advance of each job’s deadline. 
This is notable because, in practice, users with strict deadlines may find it unacceptable to discover only very close to their 
deadline that their job has been rejected. 


1. INTRODUCTION 

Modern computing applications, such as search engines and big-data processing, run on large clus¬ 
ters operated by either first or third parties (a.k.a., private and public clouds, respectively). Since 
end-users do not own the compute infrastructure, the use of cloud computation necessitates crisp 
contracts between them and the cloud provider on the service terms (i.e.. Service Level Agreements 
- SLAs). The problem of designing and implementing such contracts falls within the scope of online 
mechanism design, which concerns the design of mechanisms for allocating resources when agents 
arrive and depart over time, and the mechanism must make allocation decisions online. A contract 
can be as simple as renting out a virtual machine for a certain price per hour. However, with the 
increased variety of cloud-offered services come more performance-centric contracts, such as pay¬ 
ing per n umber of transactions | Azurd 20j5 |, or a guarantee to finish executing a job by a certain 
deadline ICurino et al.ll20l4lFerguson et al.l 2012ll . 

Since the underlying physical resources are often limited, a cloud provider faces resource man¬ 
agement challenges, such as deciding which service requests to accept in view of the required SLAs, 
and determining how best to schedule or allocate resources to the different users. For instance, the 
provider may opt to delay time-insensitive tasks when usage peaks, or prevent admission of low- 
priority jobs if higher-priority jobs are expected to arrive. To make these decisions in a principled 
manner, one wishes to design a mechanism for an online scheduling problem with deadlines, aimed 
at maximizing the total value of completed jobs. This social welfare objective is particularly rel¬ 
evant in the private cloud setting. It is also relevant for markets with competition between cloud 
providers, where each provider wishes to extend its market share by increasing user satisfaction. At 
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a high level, the goal of this paper is to provide algorithmic foundations for scheduling jobs with 
different demands, values and deadlines, in a manner that is compatible with user incentives. 

The problem can be abstracted as follows. Each job request j is associated with an arrival time a 3 , 
a size (demand) Dj, a deadline d 3 and a value v 3 . There are C identical machines that can process 
jobs. Each job uses at most a single machine at a time, and jobs can be preempted and resumed. 
The goal is to maximize the total value of jobs completed by their deadlines. In a perfect world, 
a solution to this problem would achieve a good competitive ratio, would be incentive compatible, 
and would notify jobs whether or not they are completed as swiftly as possible. Unfortunately, 
the basic online scheduling problem, without considering incentives or commitments, is inherently 
difficult even when (7 = 1. From a worst-case perspec tive, there is a p olylogarithmic lower bound 
on the competitive ratio of any randomized algorithm liCanetti and Iranill998ll . However, the known 
lower bounds only apply in the presence of jobs with tight deadlines (i.e., dj = a 3 + Dj). Recent 
work circumvented the lower bound by assuming deadline slackness , where every job j satisfies 
dj — aj > s ■ Dj for a slackness parameter s > 1 llLucier et al.ll2013ll . Our aim is to continue this 
line of inquiry and design incentive compatible scheduling mechanisms in the presence of deadline 
slackness. 

Truthfulness. In our online scheduling context, the incentive compatibility requirement is multi¬ 
parameter: agents must be incentivized to report their tuple of job parameters (vj,Dj,aj,dj). As is 
standard, we assume agents cannot deviate to an arrival time earlier than a 3 , nor report a deadline 
later than dj. These assumptions are natural if one views the arrival time as the first time the cus¬ 
tomer is able to interact with the mechanism, and that job results are not released to a customer until 
the reported deadline. Furthermore, we generally assume that a job holds no value to the customer 
unless it is fully completed. Hence, a user cannot benefit from underreporting the job demand. 

Commitments. In addition to incentive compatibility, another important feature of a practical 
scheduling mechanism is commitment: whether, and when, a scheduler guarantees to complete a 
given job. Traditionally, a preemptive scheduler is allowed to accept a job, process it partially, but 
then abandon it once its deadline has passed. While this behavior may be justified in terms of pure 
optimization, in many real-life scenarios it is not acceptable, since users might be left empty-handed 
at their deadline. In reality, users with business-critical jobs require an indication, well before their 
deadline, of whether their jobs can be processed. Since susta ining deadlines is becoming a key re¬ 
quirement for modern computation clusters (e.g., ICurino et al.ll2014ll and references therein), it is 
essential that schedulers provide some degree of commitment. 

The question is: at what point of time should the scheduler commit to jobs? One option is to 
require the scheduler to commit to jobs upon arrival. Namely, once a job arrives, the scheduler 
immediately decides_whether i t accep ts the job (and then it is required to complete it) or reject 
the job. However, iLucier et al.ll20 1 3fl proved that for general values no scheduler can commit to 
jobs upon arrival while providing any performance guarantees, even assuming deadline slackness. 
Therefore, a more plausible alternative from the user perspective is to allow the committed scheduler 
to delay the decision, but only up to some predetermined point. 

Definition 1.1. A scheduling mechanism is called /3-responsive (for /3 > 0) if, for every job j, 
by time dj — /3 • Dj it either (a) rejects the job, or (b) guarantees that the job will be completed by 
its deadline and specifies the required payment. 

Note that /3-responsiveness requires deadline slackness s > (3 for feasibility. Schedulers that do 
not provide advance commitment are by default 0-responsive; we often refer to them as being non- 
committed. Useful levels of commitments are typically obtained when /? > 1, as this provides 
rejected users an opportunity to execute their job elsewhere before their deadline. 

One might consider different definitions for responsiveness in online scheduling. In a sense, the 
definition given here is additive: for each job j, the mechanism must make its decision /3Dj time 
units before the deadline. An alternative definition could be fractional: the decision must be made 
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before some fraction of job execution window, e.g., dj — u)(dj — af) for ui £ (0,1). It turns out that 
many of our resultsQ also satisfy responsiveness under this alternative definition, as well as other 
useful propertied We discuss this further in Section[6] 

1.1. Our Results 

We design the first truthful online mechanisms for preemptive scheduling with deadlines. Moreover, 
our mechanism can be made /3-responsive as defined above. 

Main Theorem (informal): For every /3 > 0, given sufficiently large slackness s > s(j3), there 
is a truthful, /3-responsive, ()(1 (-competitive mechanism for online preemptive scheduling on C 
identical servers. 

The precise competitive ratio achieved by our mechanism depends on the level of input slackness. 
We establish the main result in two steps. First, we build a mechanism that is truthful, but not com¬ 
mitted. Second, we develop a reduction from the problem of scheduling with responsive commit¬ 
ment to the problem of scheduling without commitment. Each of these two steps may be of interest 
in their own right. In particular, we obtain in the first step a truthful 0(l)-competitive mechanism 
for online preemptive scheduling with deadlines. 

THEOREM 1.2. There is a truthful mechanism for online scheduling on multiple identical 
servers that obtains a competitive ratio of 2 + © ^-^=—for any s > 1. 

Note that, as implied by known lower bounds, this competitive ratio grows without bound as 
s —> 1. However, as s grows large, the competitive ratio we achieve approaches 2. Our approach 
for this result is to begin with a greedy scheduling rule that prioritizes jobs by value density (value 
per size), then modify this scheduler so that (a) jobs are not allowed to begin executing too close 
to their deadlines, and (b) one job cannot preempt another unless its value density is sufficiently 
greater. These modifications generate incentive issues that need to be addressed with some addi¬ 
tional tweaking. We then analyze the competitive ratio of this scheduler using dual fitting techniques, 
as described in Section [231 This analysis appears in Section[3] 

For the second step, we provide a general reduction from committed scheduler design to non- 
committed scheduler design. We will describe reduction here for 3 = s/2. The idea behind the re¬ 
duction is to employ simulation: each incoming job is slightly modified and submitted to a simulator 
for the first half of its execution window. The simulator uses the given non-committed scheduling 
to “virtually” process jobs. If the simulation completes a job, then the algorithm commits to execut¬ 
ing the job on the physical server. See Section[4]for more details. This reduction can be applied to 
any scheduling algorithm, not just the truthful scheduler described abov e. Specifically, applying our 
reduction to the (non-truthful) algorithm described in iLucier et al.ll2013li generates a (non-truthful) 
committed scheduler with a competitive ratio that approaches 5 as s grows large. 

THEOREM 1.3. There is a ( s/2)-responsive scheduler for online scheduling on multiple iden¬ 
tical servers that obtains a competitive ratio of 5 + 0 ^-^==—-f © L——^ for any s > 4. 

To obtain both truthfulness and responsiveness, we wish to compose our reduction with the truth¬ 
ful non-committed mechanism described above. One challenge is that our basic reduction preserves 
truthfulness with respect to all parameters except arrival time. We can therefore immediately ob¬ 
tain a constant competitive-ratio scheduling mechanism which is (s/2)-responsive, given sufficient 
slackness; and truthful, given that jobs do not purposely delay their arrivals. For the single server 
case, we obtain the same asymptotic bound as in Theorem ll.3l for s > 4; see Section[5] 


1 Specifically, all of the results stated in Section [1711 except for Theorem ll.4l 

2 Such as the no-early-processing property: the scheduler cannot begin to process a job without committing first to its com¬ 
pletion. This implies that any job that begins processing is guaranteed to complete. 
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To yield our most general result, we explicitly construct a scheduling mechanism that obtains full 
truthfulness based on the truthful non-committed scheduler and a general reduction from committed 
scheduling to non-committed scheduling. The construction is rather technical and significantly in¬ 
creases the competitive ratio. We obtain the following result, with constants s 0 = 12 and Co = 9 for 
the single-server case, and so = 139.872 and co = 94.248 for the case of multiple identical servers. 

THEOREM 1.4. There exist constants Co and s 0 such that there is a truthful, (2s/ sfj-responsive 
mechanism for online scheduling on multiple identical servers that obtains a competitive ratio of 



1.2. Related Work 

Online preemptive scheduling models have been widely studied in the scheduling theory for various 
objectiv es, w ith value maximization results being of most relevance to our work. Canetti and Irani 
ICanetti and Iranifl998ll consider the case of tight deadlines, obtaining a deterministic lower bound 
of k and a randomized Q (-^/log re/ log log re) lower bound, where re is the max-min ratio between 
either job values or job demands. Several upper bounds have been constructed llKoren and Shashal 
1 1 992lfl9 94lICanetti and Ir anill 1 998b IPortedl2004ll . with the best being a randomized 0(log re) algo¬ 
rithm. In | Lucier et al.ll2013ll 7 we show that by incorporating a deadline slackness constraint, a non- 
committed online preemptive scheduler for the general value model exists, and prove a bouncQ of 
2 + Q( a / 4_ 1 ) + ©( ^- 37 =—fp) on i ts competitive ratio, which is constant for every s > 1. However, 
llLucier et alll2013ll do not provide any algorithmic guarantees for committed scheduling models. 
Other constant competitive schedulers have been known only for special cases. When all demands 
are ident ical, a 5-competi tive scheduler exists, which can be improved to 2 assuming a discrete 
timeline llHaiiaghavi et al.ll2005tl . Another studied model is where the value of each job equals its 
demand; this model is known as the b usy time maximization problem llDasGunta and Palisll200(Jt 
iGarav et~ak 2002; Bar-Nov et alJl!999ll . These works can be combined to obtain a 1-responsive al¬ 
gorithm with a competitive ratio of min{5.83,1 + 1/s}; however, the algorithm cannot be extended 
to incorporate general values. 

Much less is known about truthful online scheduling mechanisms. Previous works (e.g., 
llLavi and Swamvl 120071; I Archer a nd Eva Tardosl 12001111 focus mostly on offline settings with 
makespan as main objective. llJain et al.ll201 ll l2012ll design incentive compatible algorithms for 
jobs with deadlines, but restrict attention to the offline setting. Works on online truthful schedul¬ 
ing have largely focused on achiev ing the (n on-constant) boun ds from the algorithmic literature 
[Porteil2004!Haiiaghavi et al.ll2005ll . Finally, llLucier et al.l2013tl proposes a heuristic that is incen¬ 
tive compatible and 1-responsive, but no formal bounds are provided for the competitive ratio of 
that heuristic. 

2. PRELIMINARIES 

In this section we present the scheduling model and necessary definitions (Sections l2. 1 I and l2.2l i. We 
then provide a brief overview of the dual fitting technique, which is used to analyze the proposed 
mechanisms (Section [2+}. 

2.1. Scheduling Model 

We consider a system consisting of C identical servers, which are always available throughout time. 
The scheduler receives job requests over time. Denote by J the set of all job requests received by 
the scheduler. Each job request j € J is associated with a type Tj = (vj , Dj , aj ,dj). The type of 
each job j consists of the job value Vj, the job resource demand (size) Dj, the arrival time a, and 
the deadline dj. Write T as the space of possible types. We denote by pj = Vj / Dj the value-density 
of job j. The job requests in J are revealed to the scheduler only upon arrival. The scheduler can 


3 The bound presented by fLucier et all2013ll can be generalized to this form. 
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allocate resources to jobs, provided that at any point each job is processed on at most one server and 
each server is processing at most one job. Preemption is allowed. Specifically, jobs may be paused 
and resumed from the point they were preempted. If a job is allocated to servers for a total time of 
Dj during the interval [a 3 , dj], then it is completed by the scheduler. 

An instance of the scheduling problem is represented by a type profile r = {t. ; : j £ J}. Given 
a scheduling algorithm A, denote by A(t) the jobs that are fully completed by A on an instance 
r, and by v{A{t)) their aggregate value. The goal of the scheduler is to maximize u(.4(t)). Let 
OPT denote the optimal offline algorithm. The quality of an online scheduler is measured by its 
competitive ratio, which is the worst case ratio between the optimal offline value and the value 
gained by the algorithm. In this paper, we define the competitive ratio as a function of the input 
slackness, defined s = s(r) = min { dl D | Tj = ( Vj,Dj,aj,dj) £ r}. The competitive ratio of 
an online algorithm A on inputs with slackness s, denoted cr_ 4 (s), is given by: 


c Ta(s) 


max 

t:s(t)=s 


v{OPT(t)) \ 
v{A{t)) J 


e [1,00). 


( 1 ) 


The following definitions refer to the execution of an online allocation algorithm A over an in¬ 
stance r. We drop A and r from notation when they are clear from context. Time is represented by 
a continuous variable t. For a scheduling algorithm A, denote by j\{t) the job running on server i 
at time t and by p\(t) its value-density. We use y){t) as a binar^Q variable indicating whether job j 
is running on server i at time t, i.e., whether j = j\(t) or not. We often refer to the function y' as 
the allocation of job j on server i, and to y 3 as the allocation of job j. 


2 . 2 . Mechanisms and Incentives 

Each job in J is owned by a rational agent (i.e., user), who submits it to the scheduling mechanism. 
We will be studying direct revelation mechanisms, where each user participates by announcing its 
type Tj = ( Vj,Dj,a,j,dj ) from the space T of possible types. A mechanism then consists of an 
allocation rule A : T^ —> {0, 1} J and a payment rule p : T^ —y R^. Writing A(t) as the profile 
of allocations returned by the mechanism given type profile r, we interpret Aj (r) as an indicator 
for whether the job of customer j is fully completed by its deadline. In general mechanisms can 
be randomized, in which case we can interpret Aj (t) £ [0,1] as the expected allocation of agent 
j. However, all of the mechanisms we consider in this paper are deterministic. We will restrict our 
attention to online mechanisms, which are constrained to make scheduling decisions at each point 
in time without knowledge of jobs that arrive at future times. Agents have quasilinear utilities: given 
allocations x and payments p, the utility of user j is given by u 3 (t) = VjAj(r) — Pj(r). 

We adopt a model in which we only allow late reports of arrivals, early reports of deadlines, and 
increased reports of job lengths. As discussed in the introduction, this assumption is justifiable in the 
context of allocating cloud resources. We say a mechanism is truthful if, subject to these restrictions 
on type reports, each user j maximizes expected utility by reporting his true type to the mechanism, 
for any possible declarations of the other agents. _ 

We will make heavy use of a characterization of truthfulness made by IlHaiiaghavi et al.ll2005ll . 
We say that a type t 3 = (vj,Dj,a 3 ,dj) dominates rj = (v'j, Dj , a', d') if Vj > v ' 3 , D 3 < Dj, 
aj > a'j, and dj < d'j. We then say that an algorithm A is monotone if for any type profile r, 
any j, and any rj that dominates Tj, we have that Aj{jj,T_j) < Aj{r'j,T-j). For deterministic 
algorithms, this means that if job j is allocated under input profile r, then it will also be allocated if 
customer i’s report changes from t 3 to a type that dominates t 3 . 

Theorem 2.1 (iThajiagha yi et al ](2005h). Given an allocation algorithm A there exists 
a payment rule p such that mechanism ( A, p) is truthful if and only if A is monotone. 


4 In Section lT3l we extend the range of values if (t) may receive. However, we will always treat it as an allocation indicator. 
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2.3. LP and Dual Fitting 

Our competitive ratio analysis relies on a relaxed formu latio n of the problem as a linear program 
(LP). The relaxed LP formulation was suggested in I Jain et al.l 1201 il l and considered later in 
Uain et al~1l2012l:lLucier et al.ll2013ll . In this paper, we do not require the LP formulation itself, but 
do rely on its dual. For completeness, we present below both the primal and dual programs. The 
primal program holds a variable representing the allocation of a job j & J on server i at time 
t £ [a,j , dj\. 


Primal Program. 


max 


c 


YY pjyW) dt 


d-j 


Y / yj(t) dt < d j 


Y yj-w ^ 1 

j:te[aj,dj] 


c c 3 

Yy^-Y-Y IyW dt ^ 0 


1=1 *= i c 

y)(t) > o 


Vj 


Vz,f 


Vjj t £ [o-j j dj ] 


Vj,i,te [dj,dj\ 


( 2 ) 

(3) 

(4) 

(5) 


The first two sets of constraints ([3},© are sta ndard demand and capacity constraints. The constraints 
(0) are gap-reducing constraints; see l Jain et al.l201 Hi for an interpretation of these constraints. Note 
that for the single server case, the constraints (0 are redundant, since they follow from ©. The 
primal objective 0 is to maximize the total (fractional) value. 

The dual linear program of an instance r is given as follows. 


Dual Program. 

C °° 

Y. D i a j + Y, [ foitydt (6) 

j&J i=l o 

dj 

otj + Pi(t) + 7 Tj(t) - Y J nj(t')dt' > Pj Vj [a,j,dj] (7) 

a j 

Otj, 7Tj(t) >0 Vj £j, i,t£ [aj , dj] (8) 

We provide the intuition behind the dual formulation. The dual program holds a constraint 0 for 
every tuple (j, i, t), where j is an input job, i is a server index, and t £ [aj,dj\ is a specific time. 
Note that since time is continuous, there are an infinite number of constraints. However, this does 
not impose an issue, since we do not solve the dual program explicitly. There are three types of dual 
variables. We typically set nj(t) = 0, since these variables are not required throughout this paper. 
The second variable aj is associated with each job j and appears in all of the constraints of job j. 
Setting aj = pj allows us to satisfy all of the constraints associated with job j. As a result, the 


min 


s.t. 
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dual objective function <( 6 | increases by IJ 3 o 3 = D 3 p 3 = v 3 . The n 3 variables are typically used 
to cover all the constraints of a completed job j, since the cost of covering their constraints is equal 
to their value. The last variables appear in all constraints associated with a server i and time 
t. These variables are typically used to cover the dual constraints associated with incomplete jobs, 
since these variables are shared across the constraints of all jobs. 

We denote by OPT* (r) the optimal fractional solution of the dual program for an instance r. 
Define IG(s) = max T:g ( T ) =s {v(OPT* (t))/v(OPT(t))} as the integrality gap for instances with 
slackness s. We are interested in online scheduling algorithms that induce upper bounds on the 
integrality gap. 

Definition 2.2. An online scheduling algorithm A induces an upper bound on the integrality gap 
for a given slackness s if IG(s) < cr^(s). 

The dual fitting technique bounds both the competitive ratio cr_/i(s) of an online algorithm A and 
the integrality gap IG(s) by constructing a feasible solution to the dual program and bounding its 
dual cost. Every feasible dual solution induces an upper bound on the optimal fractional solution, 
and the well-known weak duality theorem implies that v(OPT(t )) < v(OPT*(t)). Moreover, 
v(A (t)) < v{OPT[t)). Therefore, we can obtain bounds on the integrality gap and the competitive 
ratio of A. This is summarized in the following theorem. 

THEOREM 2.3 (Dual Fitting I VAZIRA N fcOOlll l. Let Abe an online scheduling algorithm. 
If for every instance r with slackness s = s(r) there exists a feasible dual solution (a, /3, 7 r) with a 
dual cost of at most r(s) ■ v(A(t)), then cva(s) < r(s)andlG(s) < r(s). 

3. TRUTHFUL NON-COMMITTED SCHEDULING 

Our first goal is to design a truthful online scheduling mechanism under the deadline slackness 
assum ption, without regar d for commitments. The algorithmic version of this problem was stud¬ 
ied in llLucier et al.ll2013li . llLucier et al.ll2013il presents a modified greedy scheduling algorithm, 
and shows tha t it o btains a constant competitive ratio for any s > 1. However, the algorithm in 
iLucier et al.ll2013|] is not monotone. We refer the reader to the full version of the paper for a coun¬ 
terexample, in which a job that would not be completed can manipulate the algorithm by reporting 
a lower value and consequently be completed by its deadline. 

In this section, we develop a new truthful mechanism At, which also obtains a constant competi¬ 
tive ratio for any s > 1. The mechanism will be parameterized by constants 7 > 1 and fi > 1, which 
will be specified below. A key element in At is dividing the jobs into buckets (classes), differenti¬ 
ated by their value densities. Precisely, the job classes are Ct = {j | p 3 £ [ 7 ^, 7 ^ +1 ) }■ Notice that 
job j belongs to class Cg for I = [log^, (p^ )J. We think of a job j' as dominating another job j if f 
is in a “higher” bucket than j. More formally, we use the following notation throughout the section: 

Definition 3.1. Given jobs j and j', we say that j' y j if |k>g 7 (Pi')J > |_log 7 (pj )J. 

At a high level, algorithm At proceeds as follows. At each point in time. At will process the job 
with highest priority according to the ordering That is, a pending job j' can preempt a running job 
j only if j ' >~ j ■ However, there is an important exception: if a job j has not begun its execution by 
time d 3 — pDj, then the scheduler will discard that job and will not schedule it thereafter (i.e., it can 
be rejected immediately). The following intuition motivates these principles. The preemption rule 
guarantees that the running jobs belong to the highest classes out of all available jobs (proven later, 
see Claim |3T2| ). This prevents users from benefiting from a misreport of their values. The decision 
to not execute a job that has not begun by time d 3 — p l) 3 is used to bound the competitive ratio; 
note that this condition implies that there is slackness in the time interval from the first time the job 
is executed, to the job’s deadline. 

We now formally describe our truthful algorithm for the single server case (see Algorithm [j] for 
pseudo-code). The extension to multiple servers can be found in the full version of the paper. 
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ALGORITHM 1: Truthful Non-Committed Algorithm At for a Single Server 
Vf, J p if) — {j £ J \ j partially processed by At at time t A f £ [aj, dj ]}. 
J E (t) = {j £ J | j unallocated by At at time t A t £ [ay, dj — pDj]}. 

Event: On arrival of job j at time t = a.j\ 

1. call ClassPreemptionRule(f). 

Event: On completion of job j at time t: 

1. resume execution of job j' = arg max { pj/ \ j' £ J P ( t )}. 

2. call ClassPreemptionRule(f). 

3. delay the output response of j until time dj. 

ClassPreemptionRule (f): 

1. j <— job currently being processed. 

2. j* £- arg max {pj* | j* £ J E (t)}. 

3- if O'* Y j) : 

3.1. preempt j and run j*. 


Note that the algorithm maintains two job sets. The first set J p (t) represents jobs j that have 
been partially processed by time t and can still be executed. The second set J E (t ) represents all 
jobs j that have not been allocated by time t, where t < dj — // Dj. 

The algorithm’s decisions are triggered by one of the following two events: either when a new 
job arrives, or when a processed job is completed. The algorithm handles both events similarly. 
When a new job j arrives, the algorithm invokes a class preemption rule, which decides which job 
to process. In this case, the arriving job j preempts the running job only if it belongs to a higher 
class. The second type of event occurs when the running job is completed. As mentioned earlier, the 
algorithm delays the output of the job until its respective deadline (line 3). When a job is completed, 
the algorithm resumes the best job j' among the preempted jobs in J p it) (line 1) and calls the class 
preemption rule (line 2). The class preemption rule would override the decision to resume j’ if there 
exists an unallocated job j* in J E (t ) belonging to a higher class. In that case, j* is processed and f 
remains preempted. Notice that in both cases, the algorithm favors jobs belonging to higher classes. 
Formally, 

Claim 3.2. Let j = jA T (t) b e the job processed at time t by At- Let j ' £ J p {t) U J E {t). 
That is, j has either been allocated by time t and t £ [dj’, dj>], or j has not been allocated by time 
t and t £ [ a,j< , dj’ — pDj\. Then, j' )/- j. 

PROOF. Assume towards contradiction that f >- j- Let t* denote the earliest time job inside the 
interval [dj/, t] during which j is allocated. Note that t* must exist, since the claim assumes that j 
has being processed at time t. At time i*, the algorithm A either started processing j or resumed 
the execution of j. For A to start j, the threshold preemption rule must have preferred j over j', 
which is impossible. The second case where A resumed the execution of job j is also impossible, 
since either j' would have been resumed instead of j, or the threshold preemption rule would have 
immediately preempted j. We conclude that j' )/- j. □ 

Claim [ 3 T 2 I implies that at any point in time, the job allocated by _4-f belongs to the highest class 
among the jobs that can be processed, i.e., either an unallocated job j such that t £ [ a,j,dj — pDj] 
or a partially processed job j such that t £ [a,j, dj}. Notice further that equalities in job classes 
are broken in favor of partially processed jobs. This feature is crucial for proving the truthfulness 
and the performance guarantees of our algorithm. Using Claim lT2l we prove an additional property, 
which is also required for establishing truthfulness. 

CLAIM 3.3. At any time t, the set J p (t) contains at most one job from each class. 
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Proof. By induction. Assume the claim holds and consider one of the possible events. Upon 
arrival of a new job j* at time t, the threshold preemption rule allocates j* only if j* >~ j. Since j is 
the maximal job in J p (t), with respect to >-, if j* is allocated then it is the single job in J p (t ) from 
its class. Upon completion of job j, it is removed from J p (t) and the threshold preemption rule is 
invoked. As before, if a new job is allocated, it belongs to a unique class. □ 

We now prove that At is truthful, i.e.. At can be used to design a truthful online scheduling 
mechanism. 

CLAIM 3.4. The algorithm At (single server) is monotone. 

The full proof of Claim [3~4l appears in Annendix lB.21 The intuition behind the result is as follows. 
The algorithm is defined so that the processing of higher-class jobs is independent of the presence of 
lower-class jobs in the system. As a result, a job j is completed if precisely two conditions hold: first, 
that there is some time in [aj, dj — pDj] in which no job of equal or higher class is executing (so that 
job j can start), and second, there are at least Dj units of time after the earliest such start time, but 
before dj, in which higher class jobs are not executing. These conditions are well-defined because 
the processing of job j does not impact the times in which jobs of higher class are processed. One 
can then note, however, that each of these two conditions are monotone with respect to the job’s 
class, length, arrival time, and deadline. One can therefore conclude that the algorithm is monotone, 
and hence truthfulness follows from Theorem l2.ll 

The competitiv e-ratio analysis of At is similar to the analysis of the non-truthful algorithm A 
jLucier et al.ll2013tl . and proceeds via the dual fitting methodology. The full proof is described in 
Appendix lB.2l Our result is the following. 

THEOREM 3.5. The mechanism At (single-server) is truthful and obtains a competitive ratio 

cr^w = 2 + e(^A_)+e(^=k_), <>i. 

3.1. Extension to Multiple Servers 

We next extend our algorithm to handle multiple servers. We provide a high level description of the 
algorithm; the details can be found in Appendix lB.3l The multiple server algorithm runs a local copy 
of the single server algorithm on each of the C servers. The algorithm allows a job to use different 
servers throughout time (equivalently, we use say that a job is allowed to migrate between servers), 
yet with some restrictions: a preempted job j can migrate to any other server before time dj — pDj. 
After that time, the job may only use the subset of servers which were allocated to it before time 
dj — pDj. We obtain the following competitive-ratio result. 

Theorem 3.6. The algorithm At (multiple-servers) obtains a competitive ratio of: 

«*.(.) = 2 + e( ? i_) + e(^=L_), S > L 

Observe that the competitive ratio for the multiple server case is (asymptotically) identical to the 
bound obtained for a single server. However, we note that the constants hidden inside 0 are slightly 
larger for the multiple-server case. 

4. COMMITTED SCHEDULING 

In this section we develop the first committed (i.e., responsive) scheduler for online scheduling 
with general job types, assuming deadline slackness. Our solution is based on a novel reduction 
of the problem to the “familiar territory” of non-committed scheduling. We introduce a parameter 
u € (0,1) that affects the time by which the scheduler commits. Specifically, the scheduler we 
propose decides whether to admit jobs during the first (1 — w)-fraction of their availability window, 
i.e., by time dj — u(dj — aj) for each job j. The deadline slackness assumption (dj — aj > sDj ) 
then implies that our scheduler is (ws)-responsive (cf. Definition ! 1.1 1 for /3 = us). 
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We start with the single server case (Section ldTl) . where we highlight the main mechanism design 
principles. We then extend our solution to accommodate multiple servers, which requires some 
subtle changes in our proof methodology (Section l4~2l >. 

Our competitive-ratio results hold for slackness values greater than some threshold (e.g., s > 4 
for the single-server case). In Section POI we provide an indication that high slackness is indeed 
required, by obtaining a related impossibility result for inputs with small slackness. 

4.1. Reduction for a Single Server 

Our reduction consists of two key components: (1) simulator : a virtual server used to simulate an 
execution of a non-committed algorithm _4; and (2) server: the real server used to process jobs. 
The speeds of the simulator and server are the same. We emphasize that the simulator does not 
utilize actual job resources. It is only used to determine which jobs to admit. We use the simulator 
to simulate an execution of the non-committed algorithm. Upon arrival of a new job, we submit the 
job to the simulator with a virtual type, defined below. If a job is completed on the simulator, then 
the committed scheduler admits it to the system and processes it on the server (physical machine). 
We argue later that the overall value gained by the algorithm is relatively high, compared to the 
value guaranteed by A. 

We pause briefly to highlight the challenges in such simulation-based approach. The underlying 
idea is to admit and process jobs on the server only after they are “virtually” completed by A on 
the simulator. If the simulator completes all jobs near their actual deadlines, the scheduler might not 
be able to meet its commitments. This motivates us to restrict the latest time in which a job can be 
admitted. The challenge is to guarantee that all admitted jobs are completed, while still guaranteeing 
relatively high value. 

We now provide more details on how the simulator and server are handled by the committed 
scheduler throughout execution. 

Simulator. The simulator runs an online non-committed scheduling algorithm A. Every arriving 
job j is automatically sent to the simulator with a virtual type t'-” ' 1 = ( Vj , Dj U \ a,j , d^), where 

= dj—co(dj —a,j) is the virtual deadline of j, andD^ = Dj/u> is the virtual demand of j. If A 
completes the virtual request of job j by its virtual deadline, then j is admitted and sent to the server. 

Server. The server receives admitted jobs once they have been completed by the simulator, and 
processes them according to the Earliest Deadline First (EDF) allocation rule. That is, at any time 
t the server processes the job with the earliest deadline out of all admitted jobs that have not been 
completed. 

The reduction effectively splits the availability window to two subintervals. The first (1 — w) 
fraction is the first subinterval and the remainder is the second. The virtual deadline d.^’ 1 serves 
as the breakpoint between the two intervals. During the first subinterval, the algorithm uses the 
simulator to decide whether to admit j or not. Then, at time d^ v \ it communicates the decision 
to the job. In practical settings, this may allow a rejected job to seek other processing alternatives 
during the remainder of the time. Furthermore, if j is admitted, the scheduler is left with at least 
ui(dj — dj) time to process the admitted job on the server. 

The virtual demand of each job j is increased to Dj/ui. We use this in our analysis to guarantee 
that the server meets the deadlines of admitted jobs. Note that we must require Dj/ut < (1 — oj)sDj, 
otherwise j could not be completed on the simulator. By rearranging terms, we get a constraint on 
the values of s for which our algorithm is feasible: s > . 

4.1.1. Correctness. We now prove that when the reduction is applied, each accepted job is guar¬ 
anteed to finish by its deadline. Note that the simulator can complete a job before its virtual deadline, 
hence it may be admitted earlier. However, in the analysis below, we assume without loss of gener- 
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ality that jobs are admitted at their virtual deadline. Accordingly, We define the admitted type of job 

j as rj o) = {vj,Dj,d { j\dj). 

Recall that Ac(t) represents the jobs completed by the committed algorithm. Equivalently, these 
are the jobs completed by the non-committed algorithm A on the simulator. To prove that Ac can 
meet its guarantees, we must show that the EDF rule deployed by the server completes all jobs in 
Ac(t), when submitted with their admitted types. It is well known that for every set of jobs S, 
if S can be feasibly allocated on a single server (i.e., before their deadline), then EDF produces a 
feasible schedule of S. Hence, it suffices to prove that there exists a feasible schedule of Ac(t). We 
prove the following general claim, which implies the correctness of our algorithm. 


THEOREM 4.1. Let S be a set of jobs. For each job j £ S, define the virtual deadline of j as 
= dj — u>(dj — a,j). If there exists a feasible schedule of S on a single sen’er with respect to 
the virtual types = (vj,Dj/u},aj,d^^ for each j £ S, then there exists a feasible schedule of 
S on a single sen’er with respect to the admitted types rj°^ = {yj, Dj, d^ !> , dj ) for each j £ S. 


Proof. We describe an allocation algorithm that generates a feasible schedule of S with respect 
to admitted types. That is, the algorithm produces a schedule where a each job j £ S is processed 
for Dj time units inside the time interval [d^ v \dj\. The algorithm we describe allocates jobs in 

decreasing order of their virtual deadlines. For two jobs j, j ' £ S, we write j' >- j when d <r> > 

In each iteration, the algorithm considers some job j £ S by the order induced by >-, breaking ties 
arbitrarily. We say that time t is used when considering j if the algorithm has allocated some job j' 
at time t ; otherwise, we say that t is free. We denote by U t and F, the set of used and free times 
when the algorithm considers j, respectively. The algorithm works as follows. Consider an initially 
empty schedule. We iterate over jobs in S in decreasing order of their virtual deadlines, breaking 
ties arbitrarily; this order is induced by >-. Each job j in this order is allocated during the latest 
possible Dj free time units. Formally, define t' = argmaxjf : |[i, dj] fl Fj\ = Dj} as the latest 
time such that there are exactly Dj free time units during \t'. dj\. The algorithm allocates j during 
those free Dj time units ]t', dj] fl Fj. 

We now prove that the algorithm returns a feasible schedule of S, with respect to the admitted job 
types. It is enough to show that when a job j £ S is considered by the algorithm, there is enough 
free time to process it; namely, there should be at least Dj free time units during [d'A 1 , dj]. Consider 
the point where the algorithm allocates a job j £ S. Define £r = max{f | [dj, dj + l] C Uj] and 
denote tR = dj + £r. By definition, the time interval [dj , £#] is the longest continuous block that 
starts at dj in which all times t £ [dj, tn\ are used. Define t/, = a 7 — £r ■ (1 — ui)/u>. We claim 

that any job f >- j allocated in the interval [df:-' !> , Ir] must satisfy [aj>,dj>] C \t L , tpj]. Assume the 
claim holds. We show how the claim leads to the theorem. Denote by Jrr all jobs j' >- j that have 
been allocated sometime during the interval [dfj V \ £#]. Obviously, we also have [aj,dj] C [tL,tn\. 
Now, since we know there exists a feasible schedule of S with respect to the virtual types, we can 
conclude that the total virtual demand of jobs in d ir U {j} is at most Ir — t.R, since the interval 

[tc, £r] contains the availability windows of all these jobs. Notice that tn — tL = (tn — d^)/u. 
Since the virtual demand is 1 /u> times larger than the admitted demand, we can conclude that the 
total amount of used time slots during [d^An] is at most (tjj — d^’’ 1 ) — Dj. Thus, there have to be 

Dj free time units during [d-p , dj] since [dj,t}f\ is completely full. It remains to prove the claim. 
Let j' £ Jlr- Notice that dj> < tR\ otherwise, the allocation algorithm could have allocated j' 
after time tR, and since we assume j' has been allocated sometime between [d^ v \ dj], this would 
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which completes the proof. □ 
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4.1.2. Competitive Ratio. We now analyze the competitive ratio obtained via the single server 
reduction. The competitive ratio is bounded using dual fitting arguments. Specifically, for every 
instance r with slackness s = s(r), we construct a feasible dual solution (a,/3) with dual cost 
proportional to v{Ac(j )), the total value gained by Ac on r. Recall the dual constraints (f7]i cor¬ 
responding to types Tj = (vj. Dj,aj,dp. For the single server case, we make two simplifications. 
First, we denote /3(f) = /3i(f) to simplify notation. Second, we assume that 7r = 0 without loss of 
general it\Q. The dual constraints corresponding to r reduce to: 

ay +/3(f) > Pj Vj €j,t€[aj,dj\. (9) 

Our goal is to construct a dual solution which satisfies © and has a dual cost of at most r-v(Ac(T)) 
for some r. Note that v(Ac(t)) = u(^1(t^^)). To do so, we transform a dual solution correspond¬ 
ing to virtual types r^' to a dual solution satisfying ({h)i. The dual constraints corresponding to the 
virtual types are: 

otj -F /3(f) > upj Vj e J, t e [a,j,dp ] ] (10) 

Assume that the non-committed algorithm A induces an upper bound on IG(s^), where d'"' 1 = 
s ■ u){\ — u>) is the slackness of the virtual types A v ), This implies that the optimal dual solution 
(a*,/3*) satisfying ( flOb has a dual cost of at most cr^s*^) • v(A(t^)) = cr a{s ( v) ) ■ v(A c (t)). 
Yet, (ct*,/3*) satisfies (flOb . while we require a solution that satisfies (0. To construct a feasible dual 
solution corresponding to the original job types r, we perform two transformations on (a*,/3*) 
called stretching and resizing. 

LEMMA 4.2 (Resizing Lemma). Let (a, /3) be a feasible solution for the dual program cor¬ 
responding to a type profile Tj = (yj, Dj , a,j, dj). There exists a feasible solution (a/, ff ) for the 
dual program with demands D'- = / • Dj for some / > 0, with a dual cost of: 

OO CO 

y -D'a' + f P' (t)dt = ^DjCtj + j- j j3(t)dt. 

{ jej 1 { 


5 This assumption is valid due to the redundancy of the primal constraints corresponding to 7r for a single server. 
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PROOF. Notice that the value density corresponding to £)'=/• Dj is p' = Pj/ f. Hence, by 
setting a'j = ctj/ f for every job j £ J and /3(t) = /3(f)// for every time t, we obtain a feasible 
dual solution corresponding to resized demands D'. The dual cost is as stated since I)' ] o', = Dj o :j 
for every job j. □ 

Lemma 4.3 (Stretching Lemma, IILucier et al .11201311 / Let (a,/3) be a feasible solu¬ 
tion for the dual program corresponding to a type profile Tj = (vj,Dj,aj,dj}. There exists a 
feasible solution (a r , 0) for the dual program with deadlines dj = dj + / • (dj — aj) for some f, 
with a dual cost of: 

OO OO 

y: Dja'j + f 0{t)dt = DjOtj + (1 + /) • f (3(t)dt. 

j£J { jeJ o 

These two lemmas allow us to bound the competitive ratio of Ac ■ 


THEOREM 4.4. Let A be a single server scheduling algorithm that induces an upper bound 
on the integrality gap IG (s^)for = s ■ w(l — u>) and to € (0,1). Let Ac be the committed 

algorithm obtained by the single server reduction. Then Ac is ujs-responsive and 


cr A c ( s ) < 


cr_4^s • w(l — 
w(l — w) 


s > 


1 

w(l — Ul) 


PROOF. We first prove that the scheduler is ws-responsive. Note that each job j is either com¬ 
mitted or rejected by its virtual deadline d!;-’ 1 = dj —ui(dj — aj). The deadline slackness assumption 
states that dj — aj > sDj for every job j. Hence, each job is notified by time dj — uisDj, as required. 

We now bound the competitive ratio. Consider an input instance r and denote its slackness by 
s = s(t ). Let t^ v> denote the virtual types corresponding to r, and let s ^ = s ■ u>( 1 — ui) denote 
their slackness. We prove the theorem by constructing a feasible dual solution (a, 0) satisfying 
and bounding its total cost. By the assumption on A, the optimal fractional solution (a*,/3*) 
corresponding to has a dual cost of at most cr_ 4 (s(G ) • u(^1(t^^)) = cr^s^) • v(Ac(t)). 
We transform (a*,/3*) into a feasible solution (a, 0) corresponding to r by applying the resizing 
lemma and the stretching lemma, as follows. 

— We first apply the resizing lemma for / = — to cover the increased job demands during simulation. 
The dual cost increases by a multiplicative factor of /. 

— We then apply the stretching lemma to cover the remaining constraints; that is, the times in the 
jobs’ execution windows not covered by the execution windows of the virtual types. We choose 
/ such that dj = d!- 1 ' 1 + f ■ (d!'p — aj ); hence, / = As a result, the competitive ratio is 
multiplied by an additional factor of 1 + / = y/—. 

After applying both lemmas, we obtain a feasible dual solution that satisfies the dual constraints ®. 
The dual cost of the solution is at most • cr^ (s • w(l — w)) • v(Ac(f)). The theorem follows 

through the correctness of the dual fitting technique. Theorem 12. 3 1 □ 

Applying Theorem l4.4l to the single server scheduling algorithm At from Section[3]and choosing 
oj = 1/2, one obtains a (s/2)-responsive scheduler with a competitive ratio that approaches 8 as s 
grows large. However, we note that a more careful analysis, specific to the algorithm At, leads to 
an improved bound (approaching 5 as s grows large). This tighter analysis, which involves merging 
the dual-fitting techniques from Theorem 14.41 with the dual-fitting techniques used to bound the 
competitive ratio of At, is described in Appendix[E] 
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4.2. Reductions for Multiple Servers 

We extend our single server reduction to incorporate multiple servers. We distinguish between two 
cases based on the following definition. 

Definition 4.5. A scheduler is called non-migratory if it does not allow preempted jobs to re¬ 
sume their execution on different servers. That is, a job is allocated at most one server throughout 
its execution. 


Constan t-competitiv e non-migratory schedulers are known to exist in the presence of deadline 
slackness IlLucier et al.ll2013ll . Given such a scheduler, we can easily construct a committed algo¬ 
rithm for multiple servers by extending the single server reduction; see full paper for details. How¬ 
ever, we do not know how to use this reduction to obtain a committed scheduler which is truthful, 
since it requires that the non-committed scheduler is both truthful and non-migratory; unfortunately, 
we are not aware of such schedulers. 

Therefore, we construct below a second reduction, which does not require a non-migratory non- 
committed scheduler. This is essential for Section 0 where we design a truthful committed sched¬ 
uler. We note that the first reduction leads to better competitive-ratio guarantees, hence should be 
preferred in domains where users are not strategic. 

4.2.1. Non-Migratory Case. In the following, let Abe a non-committed scheduler for multiple 
servers which is non-migratory. We extend our single server reduction to obtain a committed sched¬ 
uler Ac for multiple servers. The reduction remains essentially the same: the simulator runs the 
non-committed scheduler on a system with C virtual servers. When a job is completed on virtual 
server i, it is admitted and processed on server i. Each server runs the EDF rule on the jobs admitted 
to it. To prove correctness (i.e., the scheduler meets all commitments), we simply apply Theorem 
14. li on each server independently. The bound on the competitive ratio obtained in Theorem l4.4l can 
be extended directly to the non-migratory model. 


COROLLARY 4.6. Let Abe a multiple server, non-migratory scheduling algorithm that induces 
an upper bound on the integrality gap IG(s^) for s^ = s ■ to(l — to) and ui € (0,1). Let Ac be 
the committed algorithm obtained by the multiple server reduction for non-migratory schedulers. 
Then Ac is cos-responsive and 


ciU c (s) < 


cr_4^s • u>( 1 — to^j 
co( 1 — to) 


s > 


1 


to( 1 — to) 


Applying Corollarv l4.6l to the non-migratory multiple-server algorithm presented in IlLucier et al.l 
12013l l and setting to = 1/2, one obtains a (s/2)-responsive scheduling algorithm for multiple 

servers with competitive ratio 8 + 0^-^==—^ + 0^^-—=—-^. As in Theorem 14.41 one can 

achieve a tighter approximation factor of 5 + 0 + 0 using the details of 

the dual-fitting analysis from iLucier et al.ll2013ll . This gives the result described in Section [Tj] as 
Theorem ll.3l The details of this improved analysis appear in Appendix[E] 


4.2.2. Migratory Case. We now assume that A allows migrations. This will be important for truth¬ 
ful committed scheduling, explored in the next section. Unfortunately, the reduction proposed for 
the non-migratory case does not work here. We explain why: consider some job j that is admitted 
after being completed on the simulator; note that j may have been processed on more than one 
virtual server. Our goal is to process j by ti me dp Assume each server runs the EDF rule on the 
jobs assigned to it, as suggested in Section 14.2.11 Since j has been processed on more than one 
virtual server, it is unclear how to assign j to a server in a way that guarantees the completion of 
all admitted jobs. One might suggest to assign each server i the portion of j that was processed on 


EC’15, June 15-19, 2015, Portland, OR, USA, Vol. X, No. X, Article X, Publication date: February 2015. 


















Truthful Online Scheduling with Commitments 


X:15 


virtual server i. However, this does not necessarily generate a legal schedule. If each server runs 
EDF independently, a job might be allocated simultaneously on more than one_server. 

We propose the following modifications. First, we will use a result of iiChan et al.ll200-5 tl. which 
shows that any set S of jobs that can be scheduled with migration on C servers can also be sched¬ 
uled without migration on C servers with a speedup of (3 + 2\/2) m 5.828. Thus, if we increase 
the virtual demand of the jobs submitted to the simulator by this amount, then it will be possible to 
modify the resulting migratory schedule to be non-migratory. Next, instead of running the EDF rule 
on each server independently, we run a global EDF rule. That is, at each time t the system processes 
the (at most) C admitted jobs with earliest deadlines. This is known as the EDF rule for multiple 
servers (also known as f-EDF |Funk;i2004|). It is well known that the EDF rule is not optimal on 
multiple servers; formally, for a set S of jobs that can be feasibly scheduled o n C se rve rs w ith mi¬ 
gration, EDF does not necessarily produce a feasible schedule on input S l lHong and Leundfl 989(1 . 
Neverthele ss, it i s known that EDF produces a feasible schedule of S when the servers are twice 
as fast ilPhillips et al.lll997li . Thus, since server speedup is directly linked with demand inflation, if 
we double the virtual demand of the jobs submitted to the simulator, we are guaranteed that EDF 
would produce a feasible schedule for the admitted jobs. We will therefore modify the virtual de¬ 
mand of each job submitted to the simulator. The virtual demand of each job j will be increased to 
2(3 + 2 s/2)-Dj/u. The additional factor of 3 + 2v / 2 ~ 5.828 is necessary for correctness, which 
is established in the following theorem. 


THEOREM 4.7. Let Abe a multiple server scheduling algorithm that induces an upper bound 
on the integrality gap IG (s^)for s^ v ' > = s ■ w(l — co) and co £ (0,1). Let Ac be the committed 
algorithm Ac obtained by the multiple server reduction. Then Ac is cos-responsive and 


cr A c ( s ) < 


11.656 / 

—r. -- • cr A s 

w(l — co) \ 


co{l — co) \ 

11.656 J 


11.656 
co( 1 — co) 


PROOF. Fet S denote the set of jobs admitted by the committed algorithm Ac on an instance r. 
To prove correctness, we must show that there exists a feasible schedule in which each job j £ S is 
allocated 2 Dj demand during [d ^' 1 , dj]. If so, then ilPhillips et al.tll997ll implies that EDF completes 
all admitted jobs by their deadline. This follows since: 


(1) There exists a feasible schedule of S with types (vj, 11 ^ 56 • D 3 , cij , ) on C servers with 

migrati on. This is the “simulator” schedule produced by the non-committed algorithm A. 

(2) IChan et al.l 12005 1 proved that any set S of jobs that can be scheduled with migration on C 
servers can also be scheduled without migration on C servers with 5.828-speedup. As a result, 
there exists a feasible non-migratory schedule of S with types (vj ,^-Dj, a,j , d^' 1 ) on C servers. 

(3) By applying Theorem l4.1l on each server separately, we obtain a feasible non-migratory sched¬ 
ule of S with types ( Vj , 2 Dj, d^ v \ dj) on C servers, as desired. 

(4) Therefore, EDF produces a feasible schedule of the admitted jobs S with types 

{vj, D j,d^ ] ,dj). 

We note that step 4 (i.e., using EDF) is necessary. Even though Steps 2 and 3 establishe that feasible 
non-migratory schedules of S exist, they cannot necessarily be generated online, unlike EDF. The 
competitive ratio can be bounded by following the same steps as in the single server case (Theorem 
14.4I >. however the resizing lemma must be applied with / = 11.656w. Finally, note that the slackness 
s must satisfy s(l co) > 11.656/w, otherwise jobs could not be completed on the simulator. □ 

We use this reduction in Section[5]to design a truthful committed scheduler for multiple servers. 


4.3. Impossibility Result 

The committed schedulers we construct guarantee a constant competitive ratio, provided that the 
deadline slackness s is sufficiently large. For example, s has to be at least (w(l—w)) _1 for the single 
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server case, implying that s > 4 (since u> = 1/2 minimizes the expression). A valid question is 
whether these conditions on s are merely a consequence of our choice of construction, or an inherent 
property of any possible committed scheduler. In this subsection, we provide some indication that 
the latter is more likely, by provider an impossibility result. In particular, we prove a lower bound for 
committed schedulers that satisfy an additional requirement, termed no early processing. A no early 
processing scheduler is a scheduler that may not process jobs before committing to their execution. 
We note that the schedulers we have designed in this section satisfy this requirement. It is also 
worth mentioning that although we did not include no-early processing as part of our ^-responsive 
commitment definition, this is a natural property to require in many practical settings; e.g., when 
there is a cost (of data transmission, etc.) associated with beginning the execution of a job. Our 
result is the following. 

THEOREM 4.8. Consider a cluster with C < 4 machines. Then any committed scheduler that 
satisfies the no-early processing requirement has an unbounded competitive ratio for s < A/C. 

In view of Theorem 14.41 note that this bound is tight for the single server case (under the no early 
processing requirement). It remains an open question whether removing the no-early processing re¬ 
quirement could lead to bounded competitive ratio for a larger range of s. More generally, obtaining 
tighter lower bounds for multiple servers is a direction that is still unresolved. 

5. TRUTHFUL COMMITTED SCHEDULING 

In this section we construct a scheduling mechanism that is both truthful and committed. As it turns 
out, the reductions presented in the previous section preserve monotonicity with respect to values, 
deadlines, and demands, but not necessarily with respect to arrival times. Therefore, by plugging in 
an existing truthful non-committed scheduler (Section [3}. we can obtain a committed mechanism 
that is truthful assuming all arrival times are publicly known. In Section [531 we show how to modify 
the construction to achieve full truthfulness. 

5.1. Public Arrival Times 

In this subsection, we consider the case where job arrival times are common knowledge, i.e., users 
cannot misreport the arrival times of their jobs. To construct the partially truthful mechanism, we 
apply one of the reductions from committed scheduling to non-committed scheduling (Section 0} 
on a truthful non-committed mechanism, which we denote by At- We denote by A rr the resulting 
mechanism. In the following, we prove that Af c is almost truthful: it is monotone with respect to 
values, deadlines, and demands, but not with respect to arrival times. 

CLAIM 5.1. Let At be a truthful scheduling algorithm, and let Af c be a committed mechanism 
obtained by applying one of the reductions from committed scheduling to non-committed scheduling 
(assume all required preconditions apply). Then, Af c is monotone with respect to values, demands 
and deadlines. 

PROOF. Recall that upon an arrival of a new job j, the job is submitted to At with a virtual type 
of Tj 1 ' 1 = (vj, aDj, a,j, d^) for some constant a > 1 (the constant differs between the reductions 

for a single server and for multiple servers). Also recall that d!'-’' 1 = dj — ui(dj — a.j) is the virtual 
deadline of job j, which is a monotone function of dj. Moreover, A- rc then completes job j on 
input r precisely if At completes job j on input t-’A But since At is monotone, and since Vj, 
aDj, and d!'- 1 ' 1 are appropriately monotone functions of v :j , Dj, and dj (respectively), it follows that 
Af c is monotone with respect to Vj, Dj, and dj. □ 

Hence, the reductions from committed to non-committed scheduling (Theorems 14.41 and 14.7b can 
be extended to guarantee truthfulness (public arrival times), as long as the given (non-committed) 
scheduler is monotone. 
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Recall that the definition of /3-responsiveness for mechanisms requires not only that allocation 
decisions be made sufficiently early, but also that requisite payments be calculated in a timely fash¬ 
ion as well. To obtain a /3-responsive mechanism we must therefore establish that it is possible 
to compute payments at the time of commitment, for each job j. Fortunately, because the time of 
commitment is independent of a job’s reported value, this is straightforward. At the time of commit¬ 
ment, it is possible to determine the lowest value at which the job would have been accepted (i.e., 
scheduled by the simul ator). This critical value is the appropriate payment to guarantee truthfulness 
(see, e.g., jHajiaghavi et al.ll200.5il '). so it can be offered promptly. 

It is important to understand why Af c may give incentive to misreport arrival times. Consider the 
single server case, take w = 1/2, and suppose there are two jobs n = (vi, D i, ai, di) = (1,1,0, 8) 
and 72 = (v 21 D 2 , 0 , 2 , df) = (10,2,0,100). In this instance, job 1 would not be accepted: the 
simulator will process job 2 throughout the interval [0,4] (recall that demands are doubled in the 
simulation), blocking the execution of job 1. Since time 4 is the virtual deadline of job 1 (half of 
its execution window), the job will be rejected at that time. However, if job 1 instead declared an 
arrival time of 4, then the simulator would successfully complete the job by its virtual deadline of 
6 , and the job would be accepted. 

5.2. Full Truthfulness 

In this subsection, we explicitly construct a truthful, committed scheduling mechanism. The issue in 
the last example is that misreporting a later arrival time can lead to a later virtual deadline being used 
by the simulator. This ability to delay the virtual deadline can incentivize non-truthful reporting. 
We address this issue by imposing additional structure on the time intervals used for simulation. 
Given the reported job demand Dj and execution window [dj, dj], we determine a collection of 
subintervals of [a,j,dj] in which to run simulations. If the simulator accepts the job in any of these 
subintervals, we admit the job and process it in the subsequent interval; otherwise the job is rejected. 
We will construct the subintervals in such a way that monotonicity is preserved: declaring a smaller 
execution window or a greater demand can lead only to less desirable simulation windows (i.e., 
subsets of the originals). 

Truthfulness follows from the fact that the simulation parameters cannot be influenced benefi¬ 
cially by the reported arrival and departure times. The main technical challenge is to establish a 
competitive ratio bound for this modified solution; it turns out that the dual-fitting argument used to 
bound the competitive ratio of At in Section [3] can be modified to provide the necessary bounds. 
We end up with the following result. A full proof, and a more formal description of the reduction, 
appears in AppendixiDl 

THEOREM 5.2. There exist constants cq and sq such that, for any s > So, there exists a truthful, 
(2s/ so)-responsive scheduling algorithm Atc such that: 



For the case of multiple identical servers, we obtain constants cq = 94.248 and s 0 = 139.872. 
For the single server case, we obtain Co = 9 and so = 12. 

6. CONCLUSION 

This paper designs and analyzes truthful online scheduling mechanisms. Although the model studied 
herein is clearly a theoretical abstraction of the full complexity faced by scheduling of tasks in the 
cloud, we believe that the principles developed here can carry over to more complex settings. 

The /3-responsive mechanisms described in Section[4]and Section [5Tl actuallv satisfy two stronger 
properties. First, they satisfy an alternate responsiveness property: there exists a constant lo £ (0,1) 
such that the scheduler makes a commitment for each job after a (1—w) fraction of the job execution 
window has passed, i.e., by time dj — oj(dj — aj). Second, they satisfy no early processing, i.e., 
the mechanisms may process jobs only once they have committed to their completion. In contrast. 
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the truthful scheduling mechanism from Section [A2l does not necessarily satisfy the two properties. 
An interesting open question is whether there exists a (fully) truthful scheduling mechanism with 
constant competitive ratio that commits to scheduling each job before a constant fraction of its 
execution window has elapsed. 

The most obvious problem left open by our work is to improve the constants in our results. The 
mechanisms constructed for our most general results involve large constants that can potentially 
be improved. One particularly interesting question along these lines is whether one can obtain an 
approximation factor that approaches 1 as the number of servers C grows large. An additional 
avenue of future work is to extend our results to more sophisticated scheduling problems. One 
might invest igate jobs with parallelism, or jobs made up of many interdependent tasks (see, e.g., 
iBodfk etal.11201411 ). or the impact of non-uniform machines or time-varying capacity, and so on. 
The primary question is then to determine to what extent deadline slackness helps to construct 
constant-competitive mechanisms for variations of the online scheduling problem. 
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Appendices 

A. AN ALTERNATIVE NOTION OF PROMPTNESS 

Recall the definition of /3-responsiveness: a scheduling mechanism is /3-responsive (for /3 > 0) if, 
for every job j, by time dj — /3 • Dj it either (a) rejects the job or, (b) guarantees that the job will be 
completed by its deadline and specifies the required payment. In this section we discuss a different, 
equally natural notion of responsiveness. 

Given ui £ [0,1], we could ask for a scheduling mechanism to make the choice of whether to 
accept or reject each job j by time dj — oj(dj — aj). That is, a decision must be reached for each 
job when a (1 — w) fraction of its execution window has elapsed. The case uj = 1 corresponds to 
allocation decisions being made upon arrival, and w = 0 is equivalent to no commitment. 

We note that the mechanisms constructed in Section Q](non-truthful) and Section [5TT1 (truthful 
when arrival times are public) actually satisfy this alternative notion of responsiveness for a con¬ 
stant to, in addition to being /3-responsive. Indeed, for these mechanisms, /3-responsiveness actually 
follows as a corollary of this alternative form of multiplicative responsiveness, combined with the 
slackness condition. However, the truthful and /3-responsive mechanism from Section [72l does not 
satisfy multiplicative responsiveness for any constant ui. We leave open the problem of designing a 
fully truthful scheduler that makes commitments before a constant faction of each job’s execution 
window has passed. 

B. TRUTHFUL NON-COMMITTED SCHEDULING 
B.l. Non-Truthfulness of iLucier et alj|2013)l 

Recall that the non-committed algorithm by ILucier et al.ll2013tl is based on the following two prop¬ 
erties. First, a running job j can only be preempted by a job j' satisfying p,j> > 7 pj for some 
parameter 7 > 1. Second, if a job j is not allocated by time dj — pDj for some p > 1, it is not 
allocated at all. In the following, we prove that such a scheduler is not truthful. 

Assume the system consists of a single server. Consider four job types: A , B, C and D. Assume 
Pa = 1 , Pb = 7, Pc = 7 2 and Pd = 00 . Specifically, type B jobs cannot preempt type A jobs; 
type C jobs cannot preempt type B jobs; however, type C jobs can preempt type A jobs. We use 
type D jobs to maintain the server busy when needed. Our input consists of one type A job (which 
we simply refer to as A), one type B job (referred as B) and s type C jobs. We construct an instance 
such that A is not completed due to type C jobs. However, by decreasing the value of A, job B 
blocks the type C jobs from running. This allows A to complete. 

Set Da = 2 p and dA = s + p. Set as = 0.5 p and Db = s — 0.5p. Finally, set ac = p, 
dc = s + p and Dc = 1- Assume all other parameters are set such every job j satisfies Vj = PjDj 
and dj — aj = sDj. Type D jobs are set such that the server is busy until time t — 0. 

Case 1 - pa = 1. 

t < 0 The algorithm processes type D jobs. 

t = 0 The algorithm begins to process A. 

t = 0.5 p Job B arrives. The algorithm decides not to preempt A. 

t = p All type C jobs arrive. Job A is preempted. 

The algorithm processes s type C jobs until time t = s + p. 
t = s + p The type C jobs are all processed, but A is not completed by its deadline. 

Case 2 - pa <1. 

B.2. Single Server 

We prove the truthfulness of the non-committed single server algorithm. 
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t < 0 
t = 0 
t = 0.5/it 

t = /X 

t = s 
t = s + n 


The algorithm processes type D jobs. 

The algorithm begins to process A. 

Job B arrives. The algorithm preempts A and begins to process B. 

All type C jobs arrive. Job B is not preempted. 

The algorithm completes B. The type C were not allocated by dc — pDc = s. 
Hence, all type C jobs are rejected. The algorithm resumes processing A. 

The algorithm completes A by its deadline. 


Proof of Theorem I3.4I By Theorem l2.ll it suffices to show that At is monotone. Consider 
some job j. Throughout the proof, we fix the types r_ ; of all jobs beside j. To ease exposition, we 
drop T_j from our notation. Write t 3 = {Vj,Dj , aj, dj) for the true type of job j. Suppose that 
At (t 3 . T_j ) completes job j. We must show that At will still complete job j under a reported type 
of tL where r' >- t 3 . Since one can modify each component of a reported type in sequence, it will 
suffice to establish monotonicity with respect to each coordinate independently. 

Step 1: Value Monotonicity. Let us first establish value monotonicity. Consider some u' > Vj 
and assume j is completed when reporting v 3 . Let p' = v'j/Dj be the value-density and let £' = 
[log 7 (p')J be the class of job j when reporting u'. Let st'(y 3 ) be the starting time of j when 
reporting ?/•. Similarly, denote p 3 , l 3 and st(y 3 ) with respect to v 3 . Notice that if l 3 = l'j, then the 
behavior of At is identical regardless of value reported; hence j is completed. We therefore assume 
i' 3 > tj. Note then that st'{y 3 ) < st(y 3 ), since if j does not start by time st(y 3 ) under reported 
value v' 3 , then it will start at that time since it has only a higher class. 

Now, consider the case where j reports a value of v'. Observe J p (t) at time t = st'(y 3 ). Claims 
13.21 and [XH implv that no existing job can preempt job j. Specifically, any job that can run instead 
of j during the interval [st'(yj), d 3 \ must have arrived after time st'(y 3 ). If j did not complete, then 
during this interval the algorithm processed higher priority jobs during more than dj — st'iVj) - D 3 
time units. These jobs would also be preferred by At when j reports a lower value of v 3 . JJence, j 
could not have been completed when the value Vj was reported, a contradiction. 

Step 2: Monotonicity of Other Properties. Next consider misreporting a demand Dj < Dj, and 
suppose the job completes under report Dj. Then the job’s value density is higher under report Dj, 
and its latest possible starting time is increased to dj — // /J'. This only extends the possibilities 
of j being completed, and hence job j would be completed under report Dj as well. Following 
similar arguments, a later deadline <■/' instead of dj only increases the latest possible start time of j, 
and increases the time slots in which the job can be completed, which again can only increase the 
allocation to j. Finally, we show that a later arrival time a' < a 3 cannot be detrimental. We assume 
that j is completed when the job is submitted at time aj. It remains to prove that j is completed when 
submitting at time a'. This follows in a similar fashion to our argument for value monotonicity. If 
when reporting a' the job is not processed until a 3 , then both executions of At are identical, hence 
j is completed. Otherwise, j necessarily begins earlier than when a 3 is reported, and again job j is 
completed. We reach the same conclusion as before. 

Since At satisfies all required monotonicity conditions, we conclude that At is truthful. □ 


We now bound the competitive ratio of the truthful non-committed algorithm for a single server. 

Proof of Theorem 13.51 We bound the competitive ratio of At for the single server case. 
Our proof strongly relies on the original analysis of the non-committed scheduling algorithm by 
iLucier et al.ll2013li . Consider an execution of At on an instance r. Recall that Claim [3T2l states that 
at each time t, no job in J p (t ) or J E (t) can have a value density larger than 7 p^ T ( t). Furthermore, 
the algorithm does not allocate resources to any jot^ j that has not been allocated by time dj — pDj. 
For schedulers satisfying these two properties, itLucier et al.ll2013li proved that there exists a feasible 
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solution for the dual program corresponding to r, with a total dual cost of: 

OO 

v{A t {t)) + 7- ■ [ pa t (t)dt. (11) 

s — p J 
0 

It remains to bound the integral. Let T p denote the set of times during which completed jobs were 
processed, and denote by T p the remaining times. Notice that all jobs processed during T p are 
partially processed jobs. Hence, the integral can be written as v(At(t)) + J.j- P pA T {t)dt- The latter 
expression represents the total value corresponding to partial work lost from not completing jobs. 
That is, if the algorithm processed half of some job j, then it lost a partial value of 0. ov-j. 

Several useful properties of the original non-truthful algorithm are preserved in our truthful vari¬ 
ant. We prove here that At preserves these properties and show how they can be used to bound 
the lost partial value. Consider a partially processed job j. Let j' be any job other than j running 
during some time t £ [st(yj),dj]. We claim that st(yj) < st(yj'). Assume the contrary. Since j 
starts processing at time st(yj), this implies that 3 /■ However, we know that j has never been 

completed. By Claim [72l it is impossible that j' was processed at time t. Therefore, the claim holds. 
Moreover, it follows that j' >- j, since j' started being processed after time st{yj). 

Consider again the interval [st(yj), dj). Notice that the length of the interval is at least pDj. Our 
previous claims imply that during this interval the algorithm processed jobs that belong to higher 
classes than j for at least (p — 1 )Dj of the time. This translates to a value of at least (p — 1 )vj, 
since the value density of these jobs are at least pj. Intuitively, one would wish that this value 
could account for the loss of j. However, jobs processed inside the interval \st(y.j ), d ,-] have not 
necessarily been completed. This calls for a more rigorous analysis. jLucier et al.ll2013ll introduced 
a complex charging argument to bound the lost partial value. By slightly modifying their proof], we 
can obtain the following bound: 

OO 

J pA T (t) dt < v(At(t))- 
0 

By combining ( 1111 and (fl2l > we can then apply the dual fitting theorem (Theorem l2.3l l and get: 

c L4t( s ) <1+7- 

For every p, the above bound is optimized for a unique value 7* (p) = By choosing p « s 2 / 3 

we obtain the bound stated in the theorem. □ 

B.3. Multiple Servers 

We extend At to accommodate multiple servers. In the multiple server variant, which we also 
denote by At, each server runs a local copy of the single server algorithm. Specifically, if a job j 
has not been executed on a server i during the interval \a :i , dj — pDj ], then the algorithm prevents it 
from running on server i. The detailed implementation of the algorithm is given fully in Algorithmic 
The algorithm follows the general approach described here in an efficient manner. Upon arrival of a 
job j at time t, we only invoke the class preemption rule on server v' rnm (t), which is the server running 
the job belonging to the lowest class (unused servers run idle jobs of class — 00 ). Ties are broken 
in favor of the job with the later start time in the system. This is crucial for proving truthfulness. 
Notice also that it suffices to invoke the class preemption rule of server * m in(f) : if job j is rejected, it 


1 + 


( 7 - l)(/r- 1 ) - 1 


(13) 


1 + 


(7 — 1)(M — !) - 1 


( 12 ) 


6 Our analysis differs since the truthful algorithm preempts jobs according to class. Consider two jobs j,j r that belong to 
classes £, i ', respectively. Notice th at if i' — t — i > 0 then pj/ < 7 1—1 pj . This bound is weaker by a factor of 7 compared 
to an equivalent bound obtained by tLucier et al imi . which increases the bound on the lost partial value by 7 . 
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would be rejected by the class preemption rule of any other server. When job j completes on server 
i, we first load the job with maximal value-density out of the jobs preempted from server i, and then 
invoke the class preemption rule. Notice that the class preemption rule allows preempted jobs to 
migrate between servers. That is, a job preempted from server i might start executing on a different 
server at time t , provided that t < dj — pDj. 


ALGORITHM 2: Truthful Non-Committed Algorithm At for Multiple Servers 
Vf, Ji (t) = { j E J | j partially processed on server i at time t A t 6 [%, dj]}. 
jf (t ) = { j E J | j unallocated on server i at time t A t € [ay , dj — pDj] }. 

= {j} t(i) | 1 < * < C}. (jobs executing at time t) 

Event: On arrival of job j at time t = ay: 

1. call ClassPreemptionRule(j m m(f), f), where: 

*min(f) = argminj [log 7 p\{t )J | 1 < * < C } (ties broken according to later start time) 

Event: On completion of job j on server i at time t: 

1. resume execution of job j' = argmax{py/ | j' E jf (t)}. 

2. call ClassPreemptionRule(i,f). 

3. delay the output response of j until time dj. 

ClassPreemptionRule(i,f): 

1 . j <— job currently being processed on server i. 

2. j* j— argmax { pj * | j € J^(t) \ (ties broken by earlier start time) 

3. if (fyj) 

3.1. preempt j and run j*. 


We argue that the proposed mechanism is truthful. Note that claims [3721 and [331 apply on each 
server separately. However, this is insufficient for proving truthfulness. Instead, we prove the fol¬ 
lowing useful claim. Recall that yj it) indicates whether job j was allocated on server i at time t. 
Define the starting point st(yj) = min {{t | y* (t) = 1} U {oo}} of job j as the first point in time 
at which j is allocated. If no such t exists, st(yj) = oo. 

CLAIM B.l. Let j = j\ {t) be the job processed on ser\’er i at time t by At- Let j' be any 
job not running at time t, and assume j' is either an allocated job such that t £ [ay/, dj>] or an 
unallocated job such that t £ [ay/, dy/ — /iZIy], Let Cg, Cg > denote the classes of j, j', respectively. 
Then, either j > i' or t = l' A sf(yy) < sf(yy/). 

PROOF. Since each server runs a local copy of the single server algorithm, Claim COl imnlics that 
j' f- j, therefore (. > t'. It remains to prove that if t = t' then sf(yy) < sf(yy/). Assume towards 
contradiction that sf(yy') < sf(yy) (equality is impossible, since we assume j is running at time t 
and j' is not). This implies that the algorithm always prioritizes j' over j. Notice that at time st (yy) 
job j' must be running; otherwise, the algorithm would have not started processing job j. Therefore, 
at time st {yf) both jobs are running, and at time t only job j is running. We show that this scenario is 
impossible. Notice that j' cannot be preempted while j is running, since the algorithm would choose 
to preempt j instead. Hence, sometime during the interval [sf(yy),dy] bothjobs were preempted and 
j resumed execution. This is impossible, since j' would have been resumed instead of j. We reach 
a contradiction. Therefore, the claim holds. □ 

The claim implies that at every time t the algorithm is processing the C top available jobs, where 
the jobs are ordered first by their class (high to low), and in case of equality ordered by their start 
times (low to high). This observation is essential for proving truthfulness. 
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THEOREM B.2. The algorithm At for multiple servers is truthful. 

Proof. The proof follows directly from the equivalent single server proof. Consider some job j 
and two value-densities p' < p". Let (!, i' denote the corresponding classes and let st'(yj ), st"(jjj) 
denote the corresponding start times. Notice that £’ < l". We prove that also st"{yf) < st'(y-j). 
Consider the case where j has a value-density of p". If j is processed before time stfyf), the claim 
holds. Otherwise, the behavior of the algorithm up to time st'fyf) is identical in both cases. Since 
now j has a higher value density, it will also begin processing. We conclude that increasing the 
value-density only increases the priority of j, with respect to the algorithm At- Therefore, we can 
repeat the arguments that lead to prove the truthfulness of the single server algorithm. □ 

We conclude by proving the bound on the competitive ratio stated in Theorem l3.6l 


Proof of Theorem [376] Similar to the single server case, we can show that for every time t 
a running job j and a pending job j' satisfy pj> < 7p, . Furthermore, each server does not begin 
processing any job j after time dj — p Dj . iLucieretah [201 3ll proved that in this case, there exists 
a feasible solution for the dual program corresponding to r, with a total dual cost of: 


1 + 7 • 



c 00 

J PA T (t) dt 

i=l n 


(14) 


We can bound the integral J 0 °° p\ T ( t)dt for each server i individually, as done for the single server 
case. Summing over all servers, we get the following bound on the competitive ratio of At- 


cx(At) < 


- 7 ‘ 


s — p 


1 + 


(7- i)(m- 1 ) - 1 . 


(15) 


By setting 7 = - 7=37 and p ks s 2 / 3 we obtain the bound stated in the theorem. □ 


C. LOWER BOUND ON COMMITTED SCHEDULING 

In the following section we prove Theorem 14.81 We first prove that no single server committed 
scheduler can provide any constant competitive ratio for s < 4. We then generalize our bound for 
C < 3 servers, and prove an impossibility result for s < 4/(7. 

THEOREM C.l. In the single server model, any online algorithm that commits to jobs on ad¬ 
mission has an unbounded competitive ratio for s < 4. 

To prove Theorem 1C. II we describe the following adversarial strategy. The adversary sets the 
value of each arriving job to be significantly larger than the sum of all previous jobs, and waits for 
the job to be accepted before submitting a new job. This forces the algorithm to admit all arriv¬ 
ing jobs, otherwise the algorithm would not maintain a constant competitive ratio. In addition, all 
jobs share the same deadline. We first make a simplifying assumption on the scheduling algorithm, 
which we later relax. Given that all deadlines are identical, it is natural to assume that the schedul¬ 
ing algorithm does not admit a job before completing all previous commitments. We call such an 
algorithm natural. 

LEMMA C.2. In the single server model, any online natural algorithm that commits to jobs on 
admission has an unbounded competitive ratio for s < 4. 

PROOF. First note that in order to prove the lower bound, it is enough to consider work preserv¬ 
ing algorithms. An algorithm is considered work preserving if the algorithm does not remain idle 
if it has unmet commitments. We can assume this since every algorithm can be transformed into a 
work preserving algorithm and perform at least as good as the original algorithm for any input. 
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Assume towards contradiction that there is a natural algorithm with a bounded competitive ratio 
of c > 1. Denote by ji,j 2 > ■ ■ ■ the jobs submitted in order of their submission. We construct an 
adversarial strategy subject to the following invariants. 

INVARIANT C.3. Every arriving job j has a deadline dj = s and a demand Dj = s a f which 
is the largest possible demand , with respect to the slackness constraint. 

INVARIANT C.4. A new job arrives immediately when the previous job is admitted. Formally, 
let t n denote the admission time of job j n . Then, a n +i = t n . 

Recall that each job is associated with a type r 7 = (yj,Dj, a,j, dj). The first job j\ has a type 
(1,1,0, s). As long as the algorithm does not accept the job, the adversary does not submit any 
additional jobs, as stated in Invariant lC.4l Eventually, the algorithm must accept job j\, otherwise 
the competitive ratio will be unbound. At time t\, the adversary submits the next job j-j with a type 
(c +1, fi, s). Specifically, the value of 72 is significantly higher than j 1 , and the demand is set 
according to 1C. 31 Job 72 must be accepted, to maintain the guaranteed competitive ratio. We now 
submit job 73 and so forth. 

In general, the demand of each job j n is set as D n = s ~ tn ~ 1 , in accordance to Invariant |C.3l The 
value of each job j n is set as v n = (c + l) n_1 . Note that this value is at least c times larger than the 
sum of all previous job values. The adversary continues this strategy until the algorithm is forced to 
commit a job which cannot be completed by its deadline among with its previous commitments. 

Define £ n +i as the time between the completion of job j n and the admission of job j n + i ■ For¬ 
mally, we can write £ n +i = t n +1 — ( t n +D n ), since the algorithm executes j n starting time t n with¬ 
out interruption. Note that for natural algorithms, £ n +i > 0, hence t n +i > t n + D n . Let w(t) denote 
the total remaining unprocessed demand of admitted jobs at time t, that is, the total time needed for 
the scheduler to meet all remaining commitments. Correspondingly, let /(f) = (s — f) — w(t) de¬ 
note the available free time before the common deadline s. By the assumption that the scheduling 
algorithm is natural, it holds that w(t n ) = D n and therefore /(f n ) = (s — t n ) — D n . By combining 
the last equation with the definition of Dj and the bound on t n , we get that: 

s-t n+ 1 < /(f n ) = s — t n — D n = s — f„ — -—L (16) 

s 

Define A n = s t n for every n. Intuitively, A n represents the free time immediately before the 
admission of job j n . Equation (fT6b can be written as: 

A„+i < A n - ^IFZl (17) 

s 

Next, we prove that if s < 4 then A„ becomes negative, and therefore so does f(t n ) by definition. 
Assume towards contradiction that for every n we have A r , > 0. Define y n = ^" 1 . Notice that 

y n > 1 for every n, since A„ is monotonically decreasing for n by definition. By dividing (fT 71 > 
by A ra and using y n , we get that < 1 — Vn f 1 . By using the known inequality 1 — ^ < j for 

every a > 0, we get that A- < 4 e ^ or alternatively, > i > 1 (since s < 4). Hence, when 
n —i 00 we get that y n —> 00 . However, by m we have: 0 < A n +i < A„ — 1 and therefore 

= y n < s, which is a contradiction. □ 

Until now, we have only considered natural algorithms. Notice that for general scheduling 
algorithms, some of the values £ n might be negative. To overcome this difficulty, we modify the 
strategy of the adversary. 

Proof of Theorem 1C. 11 Assume towards contradiction there is a (general) algorithm that 
guarantees a competitive ratio of c > 1. As in the proof of Lemma ICAl it is enough to consider 
work preserving algorithms. We slightly modify the adversary described in Lemma ICAl to handle 
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cases where i n < 0. Let n be the first job for which l n < 0. This means that the algorithm admits 
j n before all previous commitments have been met. Recall that j n arrives at time t n -i, when job 
jn-i is admitted. Hence, job j n is admitted sometime during the execution of j n -i, since by our 
choice of n, all previous jobs j\ ... j n -i have been completed. The adversary does the following. 
Instead of submitting job j n +1 immediately at time t n (as stated in Invariant 1C.4b . the adversary 
waits \i n \ time before submitting j n +i- Let a' n+1 = t n + max{0,— £ n } denote the new arrival 
time of job j n +i ■ Notice that a' n+1 corresponds to the arrival time a„+i of j n +i if (■n would have 
been 0. We claim that by waiting \£ n \ time, the adversary sees the same setting at time a' n+1 as he 
would for a natural algorithm with all previous values l\,..., t n -\ > 0 and l n = 0. This follows 
since we assume the algorithm is work preserving, thus w(a ' n+1 ) = D n , as it would for the case 
where £ n = 0. Specifically, f(a! n+1 ) = s — a' n+1 — D n , as before. We repeat the same correcting 
procedure for every succeeding job j n for which &n ^ 0* if such job exists. Notice that since / 
is monotonically non-increasing, if f(t n ) < 0 for some job j n then f{a n + 1 ) = f(a ' n+1 ) < 0. 
Therefore, the algorithm is guaranteed to fail, as in Lemma |C21 □ 

We now generalize our impossibility result for 1 < C < 3 servers. 

PROOF OF Theorem I 4.81 Assume towards contradiction that there exists a committed algorithm 
A c for C servers with a bounded competitive ratio for s < 4/C. We can construct a single server 
algorithm A 1 2 3 for s < 4 with bounded competitive ratio, contradicting Theorem lC.il To do so, we 
translate every time unit for the C server algorithm A c to C consecutive time slots for A 1 . □ 

D. TRUTHFUL COMMITTED SCHEDULING 

In Section l5TTI we described a committed scheduler that is truthful with respect to values, deadlines, 
and demands, but not necessarily with respect to arrival time. In this section we show how to extend 
our construction to be fully truthful with respect to all parameters. For ease of readability we will 
drop the parametrization with respect to oj, and simply set oj = i in all invocations of earlier results. 

Recall that our method for building responsive schedulers is to split each job’s execution window 
into a simulation phase and an execution phase. As discussed in Section 15.11 the reason that the 
scheduler from Section O is not truthful with respect to arrival time is that a job may benefit by 
influencing the time interval in which the simulation phase is executed. By declaring a later arrival, 
a job may shift the simulation to a later, less-congested time, increasing the likelihood that the 
simulator accepts the job. 

Our strategy for addressing this issue is to impose additional structure on the timing of simula¬ 
tions. Roughly speaking, we will imagining partitioning (part of) each job’s execution window into 
many sub-intervals. A simulation will be run for each subinterval, and the job will be admitted if 
any of these simulations are successful. Our method for selecting these simulation intervals will 
be monotone: reporting a smaller execution window or a larger job can only result in smaller sim¬ 
ulation intervals. Using the truthful scheduling algorithm from Section [3] as a simulator will then 
result in an overall truthful scheduler. The competitive ratio analysis will follow by extending our 
dual-fitting technique to allow multiple simulations for a single job. 

Defining Simulation Intervals. Our method of choosing sub-intervals will be as follows. Choose 
a parameter a > 1 to be fixed later; a will determine a minimal slackness constraint for our simu¬ 
lations. Given slackness parameter s and a job t 3 = (v 3 , D 3 . a 3 . d 3 ) , let k 3 be the minimal integer 
such that 2 kj > 2<r l) 3 . The value 2 kj will be the minimal length of a simulation window. Simulation 
intervals will have lengths that are powers of 2, and endpoints aligned to consecutive powers of 2. 

We say an interval [a, h] is aligned for job j if: 

(1) [a, b] C [, a,j,dj ], 

(2) [b , b + (b — a)] C [aj,dj], and 

(3) a = t ■ 2 k and b = (t + 1) • 2 k for some integers t > 0 and k > k 3 . 
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Write Cj for the collection of maximal aligned intervals for job j, where maximality is 
with respect to set inclusion. For example, if kj = 2 and [aj, dj] = [9,50], then Cj = 
{[12,16], [16,32], [32,40], [40,44]}. Note that [16,20] is not in Cj because it is not maximal: it 
is contained in [16,32]. Also, [32,48] is not in Cj because it is not aligned for job j: the second 
condition of alignment is not satisfied, since [48,64] % [9, 50]. 

We refer to Cj as the simulation intervals for job j; it is precisely the set of intervals on which the 
execution of job j will be simulated. We now make a few observations about simulation intervals. 

PROPOSITION D.l. If a < s/12 then Cj is non-empty. 

Proof. We prove the contrapositive. If Cj is empty, then there is no subinterval of [aj , dj ] of the 
form [t ■ 2 kj , (t + 2) • 2 kj [. It must therefore be that [a ? . dj ] is contained in an interval of the form 
(t ■ 2 kj ,(t + 3) • 2 kj ). Thus (■dj — aj) < 3 • 2 kj . From the definition of kj, we have 2 kj < AaDj, 
and hence (dj — a j) < 12aDj. Since job j has slackness s, we conclude a > s/12. □ 

PROPOSITION D.2. IfCj is non-empty then Cj is a disjoint partition of an inten’al I C [aj , dj\, 
with |/| > \{dj — aj). 

PROOF. Disjointness follows because the intervals in Cj are aligned to powers of 2 and are 
maximal. That their union forms an interval follows from the fact that, for each k, the aligned 
intervals of length 2 k together form a contiguous interval. It remains to bound the length of the 
interval I. 

Choose k such that the maximal-length interval in Cj has length 2 k . Choose t\ and t 2 so that 
aj <E ((fi - l)2 fe ,fi2 fc ] and dj G [t 2 2 fc , (t 2 + l)2 fc ). Then (dj - aj) < (t 2 - ti + 2) ■ 2 k . Also, 
since Cj contains an interval of length 2 k , we must have (t 2 — fi) > 2. Moreover, each interval 
[t2 k , (t + l)2 fe ] with <i < t < t 2 — 1 is aligned for job j, and hence |/| > (t 2 — t-\ — l)2 fc . We 
conclude 

\I\ > (*2 - t! - l)2 fc > (dj aj) ■ > (dj aj) ■ \ 

where in the last inequality we used (t 2 — t\) >2. □ 

The Scheduling Mechanism: Single Server. We now describe our truthful committed scheduler, 
denoted Arc- We begin by describing the construction for the single-server case. The main idea is 
a straightforward extension of the simulation methodology described in Section |4] For each job j 
that arrives with declared type t 3 = (Vj,Dj , aj,dj), and for each subinterval /fZ If 1 } £ Cj, we 

will create a new phantom job Tp = (vj,2Dj, aZZ b^)- We will then employ the online, truthful, 
non-committed scheduling algorithm Ap from Section [2 using these phantom jobs as input. If a 
phantom job t - completes, then all subsequent phantoms for the corresponding job j are removed 
from the input, and job j is subsequently processed on the “real” server (using an EDF scheduler). 
That is, a job is admitted if any of its phantom jobs complete; otherwise, if none of its phantoms 
completes, then it is rejected. Note that since the phantom jobs have disjoint execution windows, 
it is known whether a given phantom completes before any subsequent phantom jobs arrive, and 
hence phantom jobs can be “removed” in an online fashion. 

Theorem D.3. Choose a > 1 and suppose s > 12 a. Then the scheduler Arc described 
above is 2a-responsive, truthful, and has competitive ratio bounded by 

cr A TC (s) < 8 • cr A (a). 

We prove each property of the theorem in turn. To establish responsiveness, note that the sched¬ 
uler will always commit to executing a job j by the end of the last interval in Cj. Since each aligned 
interval has length at most 2 kj , and since an aligned interval of length t must end before time dj — i 


EC’ 15, June 15-19, 2015, Portland, OR, USA, Vol. X, No. X, Article X, Publication date: February 2015. 



X:28 


Y. Azar et al. 


(condition 2 in the definition of aligned intervals), this endpoint occurs at least 2 kj > 2 crDj time 
units before dj. This implies that the scheduler is 2er-responsive. 

We next bound the competitive ratio of the modified scheduler. 

CLAIM D.4. The competitive ratio of the scheduler Atc described above is at most 

cr a tc (s) < 8 • crA r (a). 

PROOF. Consider an input instance r with slackness s. Let r <r) denote the following “phantom” 
input instance: for each job j in r we include all phantom jobs up to and including the first phantom 
accepted by A, but not those that follow. Note then that running At on inputs A v) generates the 
same total value as running Atc on input instance r. Also note that the slackness of the phantom 
input instance is at least cr. 

We prove the claim by constructing a feasible dual solution (a, (3) satisfying © and bounding its 
total cost. Let (a*, f3*) denote the optimal fractional solution of the dual program corresponding to 
r' v f We assume A induces an upper bound on the integrality gap for slackness cr. Therefore, the 
dual cost of (a *, f3 *) is at most cr _4 (cr) • v (At (r 1 ^ )) = cr _4 (cr) • v ( Atc (t) ) . 

The claim follows by applying the resizing lemma and the stretching lemma to (a*, 13*). First, we 
apply the resizing lemma for / = 2, as each phantom corresponding to job j has demand at most 
2 Dj. This increases the dual cost by a multiplicative factor of 2. Second, we apply the stretching 
lemma to all of the phantom jobs corresponding to job j, so that their execution windows remain 
disjoint and contiguous, their last deadline becomes dj, and their earliest arrival time becomes a,j. 
By Pronosition lD.21 this involves invoking the stretching lemma with / = 4. Denote by (a 1 , ff ) the 
resulting resized and stretched dual solution. Finally, for each job j we take a 3 to be the maximum 
of the entries of a' corresponding to phantoms of j, and we take /3 = /3'. 

After applying both lemmas, we obtain a feasible dual solution that satisfies the dual constraints 
©. The dual cost of the solution is at most: 

8 • cr Ar (cr) • v(A C t(t)) 

and therefore by applying the dual fitting theorem (Theorem l2.31 > we obtain our desired result. □ 

Finally, we argue that the resulting mechanism is truthful. 

Claim D.5. Scheduler Atc I s truthful, with respect to job parameters ( Vj,Dj,aj,dj). 

Proof. Consider a job j and fix the reports of other jobs. Consider two types for job j, say t 3 
and r'j, with t 3 dominating r'. Let C 3 and C' denote the sets of simulation intervals under reports 
Tj and Tj, respectively. We claim that for every interval I' G C' there exists some I G C 3 such that 

r c i. 

Before proving the claim, let us show how it implies Arc is truthful. Recall from the definition 
of At that a job is successfully scheduled in the simulator if the set of times in which a higher- 
priority job is being run satisfies a certain downward-closed condition. Moreover, the times in which 
higher-priority jobs are run is independent of the reported properties of lower-priority jobs, including 
all phantoms of job j. Thus, a job j is accepted if and only if there is some I G C 3 for which 
the corresponding phantom would complete in the simulator, and this is independent of the other 
intervals in C 3 . (Note that this independence is the only point in the argument where we use the 
specific properties of algorithm At, beyond truthfulness.) But now reporting rj dominated by t 3 
can only result in smaller simulation intervals (in the sense of set inclusion), which can only result 
in lower acceptance chance for any given simulation interval by the truthfulness of At- Thus, if job 
j is not accepted under type t 3 , it would also not be accepted under type r'. 

It remains to prove the claim about C' and C 3 . It suffices to consider changes to each parameter 
of job j separately. Changing the value v 3 has no impact on the simulation intervals. Increasing the 
demand Dj can only raise kj, which can only serve to exclude some intervals from being aligned. 
Likewise, increasing ctj or decreasing dj can also only exclude some intervals from being aligned. 
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But if the set of aligned intervals is reduced, and some interval [a, 6] lies in C[ but not in C t , then it 
must be that [a, b] is an aligned interval under reports Tj and rj, but is not maximal under report r 7 . 
In other words, there must be some [a', b'] £ C t such that [a, b] C [a/, b'], as required. □ 

Extending to Multiple Servers. We can extend our construction to multiple identical servers in 
precisely the same manner as in Theorem 14.71 Specifically, when generating phantom jobs, we 
increase their demand by an additional factor of 11.656. As in Theorem l4.7l this allows us to argue 
that the simulated migratory schedule implies the existence of a non-migratory schedule of shorter 
phantom jobs, which in turn implies that passing accepted jobs to a global EDF scheduler results in 
a feasible schedule. We obtain the following result. 

Theorem D.6. Choose a > 1 and suppose s > (12 ■ 11.656) ■ a. Then the scheduler Atc 
described above is 2a-responsive, truthful, and has competitive ratio bounded by 

Cr +rc( s ) < (8 • 11-656) • cr .4 (a). 


E. OBTAINING THEOREM STATEMENTS FROM SECTION 1.1 

The body of the paper describes the general results we obtain for truthful committed scheduling. 
In this appendix, we state the specific results we o btain b y invoking these reductions on specific 
schedulers. Specifically, the non-truthful scheduler iLucier et al.l 1201311 and the truthful scheduler 
developed in Section [3] In each case, constant bounds can be obtained by plugging the algorithms 
directly. However, we can improve the constants by via a more careful analysis, using the dual-fitting 
analysis from the original algorithms. 

For the non-truthful scheduling algorithm A from iFucier et al.ll2013li , the competitive ratio is 
bounded by explicitly constructing a feasible dual solution (a, j3) and bounding its dual cost. The 
following bounds were obtained: 


3 

C 00 

yz J Pi(f) dt = v ( a ( t )) 


1 + 0 


1 + 0 


1 


i 


+ © 




(18) 

(19) 


The same bounds are obtained for the truthful algorithm At (from Section [3), however the power 
in the last asymptotic bound is 3 instead of 2. 

When analyzing the competitive ratio of reductions for committed scheduling (Theorem 14.41 
Corollary 14.61 Theorem 14.71 and Theorem 15.21 ). in each case we apply the resizing lemma and 
the stretching lemma on the dual solution (a,/3). However, the constant blowup from application 
of these lemmas only affects the j3 term. Accounting for this leads to improved constants in the 
resulting competitive ratios. __ 

For example, applying Corollary 14.61 to the algorithm A from iFucier et al.ll2Q13li and setting 
oj = 1/2, one obtains a factor 4 blowup. Applying this blowup only to the /3 term in the dual-fitting 
analysis of A, one obtains a final competitive ratio of 


1 + 0 
= 5 + 0 
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Vs- 1 )_ 
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^ 5-1 


0 


1 + 0 
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(^i-i)VJ 


(^i-1) 2 


yielding the result described in Section fFTI as Theorem ll.3l 

As another example, applying Theorem ID. 3 1 to the algorithm At from Section 0 yields a fac¬ 
tor 8 blowup. Applying this blowup only to the /3 term in the dual-fitting analysis, one obtains a 
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competitive ratio of 


1 + 0 



= 9 + 0 



+ 0 


. 1 + 0 (^5-l) +e ((^i-l)3) 

(^i-1) 3 )' 


The extension to multiple servers follows the same approach as Theorem l4.7l requiring an additional 
application of the resizing lemma with a factor / « 11.656. Again applying this only to the fi term, 
this increases the constant portion of the competitive ratio to 1 + 8 • 11.656 « 94.248, and increases 
the slackness requirement by an additional factor of 11.656. This yields the result described in 
Section lrTl as Theorem 11.41 and restated as Theorem [+2] 
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